NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

ANT: Adapt Network Across Time for Efficient Video Processing

https://doi.org/10.1109/CVPRW56347.2022.00293

Liang, Feng; Chin, Ting-Wu; Zhou, Yang; Marculescu, Diana (June 2022, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW))

Full Text Available
QUIDAM: A Framework for Qu ant i zation-Aware D NN A ccelerator and M odel Co-Exploration

https://doi.org/10.1145/3555807

Inci, Ahmet; Virupaksha, Siri Garudanagiri; Jain, Aman; Chin, Ting-Wu; Thallam, Venkata Vivek; Ding, Ruizhou; Marculescu, Diana (September 2022, ACM Transactions on Embedded Computing Systems)

As the machine learning and systems communities strive to achieve higher energy-efficiency through custom deep neural network (DNN) accelerators, varied precision or quantization levels, and model compression techniques, there is a need for design space exploration frameworks that incorporate quantization-aware processing elements into the accelerator design space while having accurate and fast power, performance, and area models. In this work, we present QUIDAM , a highly parameterized quantization-aware DNN accelerator and model co-exploration framework. Our framework can facilitate future research on design space exploration of DNN accelerators for various design choices such as bit precision, processing element type, scratchpad sizes of processing elements, global buffer size, number of total processing elements, and DNN configurations. Our results show that different bit precisions and processing element types lead to significant differences in terms of performance per area and energy. Specifically, our framework identifies a wide range of design points where performance per area and energy varies more than 5 × and 35 ×, respectively. With the proposed framework, we show that lightweight processing elements achieve on par accuracy results and up to 5.7 × more performance per area and energy improvement when compared to the best INT16 based implementation. Finally, due to the efficiency of the pre-characterized power, performance, and area models, QUIDAM can speed up the design exploration process by 3-4 orders of magnitude as it removes the need for expensive synthesis and characterization of each design.
more » « less
Full Text Available
Renofeation: A Simple Transfer Learning Method for Improved Adversarial Robustness

https://doi.org/10.1109/CVPRW53098.2021.00362

Chin, Ting-Wu; Zhang, Cha; Marculescu, Diana (June 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW))
null (Ed.)
Full Text Available
Width transfer: on the (in)variance of width optimization

https://doi.org/10.1109/CVPRW53098.2021.00334

Chin, Ting-Wu; Marculescu, Diana; Morcos, Ari S. (June 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW))
null (Ed.)
Full Text Available
One Weight Bitwidth to Rule Them All

Chin, Ting-Wu; Chuang, Pierce; Chandra, Vikas; Marculescu, Diana (August 2020, European Conference on Computer Vision Workshops)

Weight quantization for deep ConvNets has shown promising results for applications such as image classification and semantic segmentation and is especially important for applications where memory storage is limited. However, when aiming for quantization without accuracy degradation, different tasks may end up with different bitwidths. This creates complexity for software and hardware support and the complexity accumulates when one considers mixed-precision quantization, in which case each layer’s weights use a different bitwidth. Our key insight is that optimizing for the least bitwidth subject to no accuracy degradation is not necessarily an optimal strategy. This is because one cannot decide optimality between two bitwidths if one has smaller model size while the other has better accuracy. In this work, we take the first step to understand if some weight bitwidth is better than others by aligning all to the same model size using a width-multiplier. Under this setting, somewhat surprisingly, we show that using a single bitwidth for the whole network can achieve better accuracy compared to mixed-precision quantization targeting zero accuracy degradation when both have the same model size. In particular, our results suggest that when the number of channels becomes a target hyperparameter, a single weight bitwidth throughout the network shows superior results for model compression.
more » « less
Full Text Available
AdaScale: Towards Real-time Video Object Detection Using Adaptive Scaling

Chin, Ting-Wu; Ding, Ruizhou; Marculescu, Diana (April 2019, Systems and Machine Learning Conference)

Full Text Available
Regularizing Activation Distribution for Training Binarized Deep Networks

Ding, Ruizhou; Chin, Ting-Wu; Liu, Zeye; Marculescu, Diana (June 2019, IEEE Conference on Computer Vision and Pattern Recognition)

Full Text Available
FLightNNs: Lightweight Quantized Deep Neural Networks for Fast and Accurate Inference

https://doi.org/10.1145/3316781.3317828

Ding, Ruizhou; Liu, Zeye; Chin, Ting-Wu; Marculescu, Diana; Blanton, R. D. (June 2019, ACM/IEEE Design Automation Conference)

Full Text Available
Understanding the Impact of Label Granularity on CNN-Based Image Classification

https://doi.org/10.1109/ICDMW.2018.00131

Chen, Zhuo; Ding, Ruizhou; Chin, Ting-Wu; Marculescu, Diana (November 2018, IEEE International Conference on Data Mining Workshops)

Full Text Available
Designing adaptive neural networks for energy-constrained image classification

https://doi.org/10.1145/3240765.3240796

Stamoulis, Dimitrios; Chin, Ting-Wu; Prakash, Anand Krishnan; Fang, Haocheng; Sajja, Sribhuvan; Bognar, Mitchell; Marculescu, Diana (November 2018, IEEE/ACM International Conference on Computer-Aided Design)

Full Text Available

Search for: All records